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Abstract 

We study sparse spikes deconvolution over the space of Radon mea¬ 
sures on R or T when the input measure is a finite sum of positive Dirac 
masses using the BLASSO convex program. We focus on the recovery 
properties of the support and the amplitudes of the initial measure in the 
presence of noise as a function of the minimum separation t of the input 
measure (the minimum distance between two spikes). We show that when 
w/\, and are small enough (where A is the regulariza¬ 

tion parameter, w the noise and N the number of spikes), which corre¬ 
sponds roughly to a sufficient signal-to-noise ratio and a noise level small 
enough with respect to the minimum separation, there exists a unique 
solution to the BLASSO program with exactly the same number of spikes 
as the original measure. We show that the amplitudes and positions of 
the spikes of the solution both converge toward those of the input measure 
when the noise and the regularization parameter drops to zero faster than 

1 Introduction 

1.1 Super-Resolution and Sparse Spikes Deconvolution 

Super-resolution consists in retrieving fine scale details of a possibly noisy 
signal from coarse scale information. The importance of recovering the high 
frequencies of a signal comes from the fact that there is often a physical blur in 
the acquisition process, such as diffraction in optical systems, wave reflection in 
seismic imaging or spikes recording from neuronal activity. 

In resolution theory, the two-point resolution criterion defines the ability of 
a system to resolve two points of equal intensities. As a point source produces 
a diffraction pattern which is centered about the geometrical image point of the 
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point source, it is often admitted that two points are resolved by a system if the 
central maximum of the intensity diffraction of one point source coincides with 
the first zero of the intensity diffraction pattern of the other point. This dehnes 
a distance that only depends on the system and which is called the Rayleigh 
Length. In the case of the ideal low-pass hlter (meaning that the input signal is 
convolved with the Dirichlet kernel, see (9) for the exact definition) with cutoff 
frequency fc, the Rayleigh Length is l//c- We refer to [9] for more details 
about resolution theory. Super-resolution in signal processing thus consists in 
developing techniques that enable to retrieve information below the Rayleigh 
Length. 

Let us introduce in a more formal way the problem which will be the core 
of this article. Let X be the real line M or the 1-D torus T = M/Z and Ai{X) 
the Banach space of bounded Radon measures on X, which can be seen as the 
topological dual of the space ‘^x where ‘rfx is either the space of continuous 
functions on M that vanish at inhnity when df = K or the space of continuous 
functions on T when X = T. We consider a given integral operator <!> : Al(df) —)■ 
"H, where "H is a separable Hilbert space, whose kernel (p is supposed to be a 
smooth function (see Definition 1 for the technical assumptions made on (p), i.e. 

ymGM{X), = / p{x)dm{x). (1) 

Jx 

$ represents the acquisition operator and can for instance account for a blur in 
the measurements. In the special case of p{x) = ip{- — x), $ is a convolution 
operator. We denote by rnao,tzo = J2i=i <^o,iStzo,i our input sparse spikes train 
where the aop € are the amplitudes of the Dirac masses at positions tzg^i G 
X. Let yt = d>TOao,t 2 o be the noiseless observation. The parameter t > 0 controls 
the minimum separation distance between the spikes, and this paper aims at 
studying the recovery of niao^tzo from yt + w (where w G H is some noise) when 
t is small. 

1.2 Prom the LASSO to the BLASSO 

LASSO. regularization techniques were first introduced in geophysics (see 
[6, 17, 20]) for seismic prospecting. Indeed, the density changes in the un¬ 
derground can be modeled as a sparse spikes train. reconstruction property 
provides solutions with few nonzero coefficients and can be solved efficiently with 
convex optimization methods. Donoho theoretically studied and justihed these 
techniques in [11]. In statistics, the £^ norm is used in the Lasso method [23] 
which consists in minimizing a quadratic error subject to a £^ penalization. As 
the authors remarked it, it retains both the features of subset selection (by set¬ 
ting to zero some coefficients, thanks to the property of the £^ norm to favor 
sparse solutions) and ridge regression (by shrinking the other coefficients). In 
signal processing, the basis pursuit method [22] uses the £^ norm to decompose 
signals into overcomplete dictionaries. 
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BLASSO. Following recent works (see for instance [1, 3, 5, 8, 12]), the sparse 
deconvolution method that we consider in this article operates over a continuous 
domain, i.e. without resorting to some sort of discretization on a grid. The in¬ 
verse problem is solved over the space of Radon measures which is a non-reflexive 
Banach space. This continuous “grid free” setting makes the mathematical anal¬ 
ysis easier and allows us to make precise statement about the location of the 
recovered spikes locations. 

The technique that we study in this paper consists in solving a convex opti¬ 
mization problem that uses the total variation norm which is the equivalent of 
the norm for measures. The norm is known to be particularly well fitted 
for the recovery of sparse signals. Thus the use of the total variation norm 
favors the emergence of spikes in the solution. 

The total variation norm is defined by 

VmeAf(A), |m|(A) =■ sup \f V’dm ; ||V'||l~(x) < 1 

In particular, 

|Wao,tzol(^) “ ll^olli ) 

which shows in a way that the total variation norm generalizes the norm to 
the continuous setting of measures (i.e. no discretization grid is required). 

The first method that we are interested in is the classical basis pursuit, 
defined originally in [22] in a finite dimensional setting, and written here over 
the space of Radon measures 

min {\m\{X); = yt} . {'Po{yt)) 

meMiX) 

This is the problem studied in [5], in the case where is an ideal low-pass filter 
on the torus (i.e. A = T). 

When the signal is noisy, i.e. when we observe yt + w instead, with w & 
we may rather consider the problem 

min \\\^m-{yt+w)\\l^+\\m\{X). {Vx{yt+w)) 

Here A > 0 is a parameter that should adapted to the noise level ||wll.j^. This 
problem is coined “BLASSO” in [8]. 

While this is not the focus of this article, we note that there exist algorithms 
to solve the infinite dimensional convex problems (Voiyt)) and {V\{yt + w)), see 
for instance [3, 5]. 

1.3 Previous Works 

LASSO/BLASSO performance analysis. In order to quantify the recov¬ 
ery performance of the methods Voiyt) and V\{yt +w), the following two ques¬ 
tions arise: 


3 



1. Does the solutions of Vo{yt) recover the input measure mao,tzo ? 

2. How close is the solution of V\{yt + w) to the solution of Vo{yt) ? 

When the amplitudes of the spikes are arbitrary complex numbers, the an¬ 
swers to the above questions require a large enough minimum separation dis¬ 
tance A(tzo) between the spikes where 

A(teo) =' min dxitzoy, tzoj). (2) 

where dx is either the canonical distance on M i.e. 

yx,yGR, dx{x,y) = \x - y\, (3) 

when A = M, or the canonical induced distance on T i.e. 

dx{x + I^Ty +'E) = unn\x — y + k\, (4) 

when A = T. The first question is addressed in [5] where the authors showed, 
in the case of $ being the ideal low-pass filter on the torus (see (9)), i.e. when 
"H = T, that TOao,tzo the unique solution of Voijjt) provided that A(tzo) ^ ^ 
where C > 0 is a universal constant and fc the cutoff frequency of the ideal 
low-pass filter. In the same paper, it is shown that C ^ 2 when oq G and 

C ^ 1.87 when oq G In [12], the authors show that necessarily C ^ 

The second question receives partial answers in [14, 3, 4, 13]. In [3], it is 
shown that if the solution of ’Po(yt) is unique then the measures recovered by 
'Pxiyt + w) converge in the weak-* sense to the solution of Vo{yt) when A —)■ 0 
and ||w||.j^ /A —> 0. In [4], the authors measure the reconstruction error using 
the norm of an ideal low-pass filtered version of the recovered measures. 
In [14, 13], error bounds are given on the locations of the recovered spikes with 
respect to those of the input measure mao,tzo- However, those works provide 
little information about the structure of the measures recovered by 'P\(yt + w). 
That point is addressed in [12] where the authors show that under the Non 
Degenerate Source Condition (see Section 2 for more details), there exists a 
unique solution to V\{yt + w) with the exact same number of spikes as the 
original measure provided that A and ||rc||^/A are small enough. Moreover 
in that regime, this solution converges to the original measure when the noise 
drops to zero. 

LASSO/BLASSO for positive spikes. For positive spikes (i.e. ao,i > 0), 
the picture is radically different, since exact recovery of niaoyzo without noise 
(i.e. {w,X) = (0,0)) holds for all f > 0, see for instance [8]. Stability con¬ 
stants however explode as t —> 0. A recent work [19] shows however that stable 
recovery is obtained if the signal-to-noise ratio grows faster than 0{l/t'^^), 
closely matching optimal lower bounds of 0{l/t'^^~^) obtained by combinato¬ 
rial methods, as also proved recently [10]. Our main contribution is to show 
that the same 0{l/t‘^^~^) signal-to-noise scaling in fact guarantees a perfect 
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support recovery of the spikes under a certain non-degeneracy condition on the 
filter. This extends, for positive measures, the initial results of [12] by providing 
an asymptotic analysis when t —)■ 0. 

MUSIC and related methods. There is a large body of literature in signal 
processing on spectral methods to perform spikes location from low frequency 
measurements. One of the most popular methods is MUSIC (for Multiple Signal 
Classification) [21] and we refer to [15] for an overview of its use in signal process¬ 
ing for line spectral estimation. In the noiseless case, exact reconstruction of the 
initial signal is guaranteed as long as there are enough observations compared 
to the number of distinct frequencies [18]. Stability to noise is known to hold 
under a minimum separation distance similar to the one of the BLASSO [18]. 
However, on sharp contrast with the behavior of the BLASSO, numerical obser¬ 
vations (see for instance [7]), as well as a recent work of Demanet and Nguyen, 
show that this stability continues to hold regardless of the sign of the ampli¬ 
tudes Ooy, as soon as the signal-to-noise ratio scales like Note that 

this matches (when w is a Gaussian white noise) the Cramer-Rao lower bound 
achievable by any unbiased estimator [2]. This class of methods are thus more 
efficient than BLASSO for arbitrary measures, but they are restricted to oper¬ 
ators d) that are convolution with a low-pass filter, which is not the case of our 
analysis for the BLASSO. 

1.4 Contributions 

Main results. From these previous works, one can ask whether exact support 
estimation by BLASSO for positive spikes is achievable when t tends to 0. Our 
main result. Theorem 2, shows that this is indeed the case. It states, under some 
non-degeneracy condition on $, that there exists a unique solution to 'P\(yt + w) 
with the exact same number of spikes as the original measure provided that 
j|w;j|.j^/A, \\w\\-^ and are small enough. Moreover we give an 

upper bound, in that regime, on the error of the recovered measure with respect 
to the initial measure. As a by-product, we show that the amplitudes and 
positions of the spikes of the solution both converge towards those of the input 
measure when the noise and the regularization parameter tend to zero faster 
than 

Extensions. We consider in this article the case where all the spikes locations 
tzQ cluster near zero. Following for instance [19], it is possible to consider a 
more general model with several cluster points, where the sign of the Diracs 
is the same around each of these points. Our analysis, although more difficult 
to perform, could be extended to this setting, at the price of modifying the 
definition of ryw (see Definition 3) to account for several cluster points. 

Lastly, if the kernel $ under consideration has some specific scale a (such 
as the standard deviation of a Gaussian kernel, or ct = l//c for the ideal low- 
pass filter in the case of the deconvolution on the torus), then it is possible to 
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state our contribution by replacing t by the dimensionless quantity SRF '*=' tja 
(called “super-resolution factor” in [19]). It is then possible to extend our proof 
so show that the signal-to-noise ratio should obey the scaling 1/SRF^^“^. 

Roadmap. The exact statement of Theorem 2 (our main contribution), and 
in particular the definition of the non-degeneracy condition, requires some more 
background, which is the subject of Section 2. The proof of this result can 
be found in Sections 3, 4 and 5. It relies on an independent study of the 
asymptotic behavior of quantities depending on the operator ^tzo (such as its 
pseudo-inverse) when t tends to zero. This takes place in Section 3. Note that 
a sketch of proof of the main result can be found in Section 2.3. 

1.5 Notations 

Measures. We consider If = M or X = T as the space of Dirac masses 
positions. X equipped with the distance dx (see (3) and (4)) is a locally compact 
metric space (compact in the case If = T). We denote by M{X) the space 
of bounded Radon measures on If. It is the topological dual of the Banach 
space '^x (endowed with IHIloo^j;-)) of continuous functions defined on X, that 
furthermore are imposed to vanish at inhnity in the case If = K. The two 
problems that we study, i.e. the Basis Pursuit for measures Voivt) and the 
BLASSO V\{yt+w), are two convex optimization problems on the space A4(Ar). 

Let us consider oq G (M^)'^ and Zq G When AT = T, we make the 
assumption that zq G (—|, |)^. We define 

Ao = A(zo), (5) 

where A(zo) is introduced in (2). We denote by B(x, r) (resp. B(x, r)) the closed 
(resp. open) £°° ball in with center x and radius r. We define neighborhoods 
of respectively zq and oq as 

^ def, Ao\ ^ d,t. min*(ao,*)\ , . 

Szo=B(zo,—j and Sao=B(ao, - 4 ~^ ) ' 

Note that when AT = T, if z G Bzq then z G (—|, 5)^ and for all t G (0,1], 
tz G (—|, ^)^. As a result, depending on the context, we consider tzo and tz 
as elements of by identifying them with their equivalent classes. 

Then, the initial measure we want to recover is of the form 

N 

^aQ^ZQ — ^ ^ O’O^jdtzQ i ■ 
i=l 

Note that for example, in the case AT = T, when we write Stzo i ; tzo^i is considered 
as an element of T. 

We use the parameter t G (0,1] to make the spikes locations tend to 0 in X, 
so that the minimum separation distance of mao,tzo ■ A(tzo) —> 0 when t —)■ 0. 
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Kernels. The admissible kernels Lp •. X ^ % defining the integral operators $ 
(modeling the blur of acquisition) are functions, defined on X and taking values 
in a separable Hilbert space "H, satisfying some regularity properties listed in 
the following Definition. 

Definition 1 (Admissible kernels). We denote by KER^, the set of admissible 
kernels of order k. A function ip : X H belongs to KER^ ‘P & 'W^{X,'H). 
When A = M, (/? must also satisfies the following requirements: 

• For all p G H, {p{x),p)-^ —> 0 when \x\ -G +oo. 

• For alio ^i ^ k, sup ||(/?^*^(a;)||.j, < +oo. 

xen 

Linear operators. We consider a linear operator <!> : Ai{X) —> H of the form 

Vm e At(A), <f>m ‘^= f p{x)dm{x), (7) 

Jx 

where p : X ^ FL belongs to KER°. $ is weak-* to weak continuous. Its adjoint 
$* ; "H is given by 

Vp gfl, yxG X, (<i>*p)(x) = {p{x),p)^. 

A typical example is a convolution operator, where p{x) = p[- — x), for 
some continuous function : A —)• M, p € L^(A) n ‘ifx, so that (<I>m)(u) = 
p(v — x)dm{x). In particular, in the case A = M, the Gaussian filter is 
defined as 

Vx e M, p{x) = paix) =' e~^ (8) 

where pc is the Gaussian kernel. In the case A = T, a typical example of 
convolution operator is the ideal low pass filter which is defined as 

fa 

VxGT, p{x) = pd{x)^^- Y. (9) 

k = -fa 

for some fc G N* called the cutoff frequency, pu is the Dirichlet kernel with 
cutoff frequency fc- 

Remark I. This last example is equivalently obtained (as considered for instance 
in [5]) by using FL = (endowed with its canonical inner product) in place 

oiFL = L^(T), and defining p as p{x) = Note that, to simplify 

the notation, we consider in this paper real Hilbert spaces "H, but our analysis 
readily extends to complex Hilbert spaces. 

Given general p and $ as in (7), and given x = {xi,... ,xn) G X^, we 
denote by $2 : —>■ FL the linear operator such that 

N 

Va G d>s(a) =' ^ atp{xi), 
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and by Fg : x M^) —>■ % the linear operator defined by 

Fx k) =' {aj'pi.Xi) + bjip'jxj)). 

For ip G KER^, operators involving the derivatives are defined similarly, 

N 

i=l 

We occasionally write (resp. for (resp. for and we 

adopt the following matricial notation 

$x = ((^(cci) ... ip{xN)) and Fx = ($x $f), 

where the “columns” of those “matrices” are elements of "H. In particular, the 
adjoint operator : "H —)■ is given by 

Vpe-H, 

We denote by (pk GH the derivative of ip at 0, z.e. 

(10) 

In particular, ip^ = (/j(0). 

Given fc G N, we define 

^'fc =' (t/^o ^1 ■■■ ^k) ■ (11) 

If —)■ % has full column rank, we define its pseudo-inverse as 'I'^ ‘^=' 

(5'fc'I'fc)“^5'fc. Similarly, we denote FJ = (FJFx)“^F| provided Fx has full 
column rank. 


Linear algebra. For 2 ; G we let 


/ 1 

■21 

R, =■ 


1 0 

Zjv 1 


0 

1 




e K21 Vx21V^ (^2) 


\ (2Ar-l)! 


(2Ar-l)! (2Ar-2)! 


(2N-2)\ / 


SO that is the matrix of the Hermite interpolation at points zi,...zn 

when IR 2 Ar-i [-^] is equiped with the basis • 

For each TV G N, we define 


5n = (1,0,...,0)^gM'^, 
liv = (1,1,...,!)^ gM"^. 
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We use the norm, for vectors of or whereas the notation 
II'll refers to an operator norm (on matrices, or bounded linear operators) ■ IMI« 
is the norm on H associated to the inner product IHIl“(x) denotes the 

norm for functions defined on X. 

2 Asymptotic Analysis of Support Recovery 

This section exposes our main contribution (Theorem 2). The hypotheses 
of this result require an injectivity condition and a non-degeneracy condition, 
that are explained in Sections 2.1 and 2.2. 

2.1 Injectivity Hypotheses 

We introduce here an injectivity hypothesis which ensures the invertibility 
of ‘bj'z'htz and Tl^Vtz for t > 0 small enough. 

In the case of the ideal low-pass filter (defined in (9)), Tj has full column 
rank provided that x = {xi,... ,xn) € X^ has pairwise distinct coordinates 
(see [12], Section 3.6, Proposition 6). That property is not true for a general 
operator $. However, in this paper we focus on sums of Dirac masses that 
are clustered around the point 0 G X, i.e. a; = tz for t > 0 and z € with 
pairwise distinct components. The following assumption, which is crucial to our 
analysis, shall ensure that T^^ has full rank at least for small t. 

Definition 2. Let ip : X ^ H. For all k € N, we say that the hypothesis Ik 
holds if and only if 

if G KER* and ((po,... ,ipk) are linearly independent in FL. {Ik) 

See Definition 1 for the definition of the space KER^ and Equation (10) for 
the definition of (pk- 

lilk holds, then is a symmetric positive definite matrix, where 4'fc is 

defined in (11). 

To exemplify the meaning of this injectivity hypothesis. Proposition 1 below 
considers the case V = T with $ a convolution operator. 

Proposition 1. Let ip G “^^(T, M) (where (p{x) = (p{- — x)), then Ik holds if 
and only if (po has at least k + 1 non-zeros Fourier coefficients. Ln particular if 
$ is the ideal low-pass filter with cutoff frequency fcGW, Ik holds if and only 
ifk^ 2/c. 

The proof of this proposition is given in Section A.l. 

As we shall see in Section 3, the conditions In-i and I 2 N -1 imply respec¬ 
tively the invertibility of and Pj^Ptz, provided that t is small enough. 

According to Proposition 1, in the special case of an ideal low-pass filter, these 
conditions holds if and only if fc is large enough with respect to the number N 
of spikes. 
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2.2 Vanishing Derivatives Precertificate 

Following [12] (Section 4.1, Definition 6), we introduce below the so called 
“vanishing derivatives pre-certificate” rjy t, which is a function defined on X 
that interpolates the spikes positions and signs (here -1-1). Note that r]v,t can 
be computed in closed form by solving the linear system (15). 

Proposition 2 (Vanishing derivatives precertificate, [12]). has full col¬ 

umn rank, there is a unique solution to the problem 

inf{lbll« ;Vi = l,...,V, (4>*p)(teo.O = l,(^’»'(^ 2 oy) = 0}. 

Its solution pv,t is given by 

pv,t = r+’; (Y) ’ 

and we define the vanishing derivatives precertificate as r]v,t "= ^*Pv,t- 

As shown in [12] (see Section 4.2), rjy t governs the support recovery prop¬ 
erties of the BLASSO V\{yt -f w). More precisely, if rfy^t is non-degenerate, 
i.e. 

e X\{tzo^i,...,tzo^N}, |w.t(a;)| < 1, 

I \/i€ N}, r7(^_i(tzo,*) 7^ 0, 

then there exists a low noise regime in which the BLASSO recovers exactly the 
correct number N of spikes, and the error on the locations and amplitudes is 
proportional to the noise level. 

The constants involved in the main result of [12] (Theorem 2) depend on the 
value of t > 0. The goal of this paper is to precisely determine this dependency, 
and to show that this support recovery result extends to the setting where t —)■ 0, 
provided that (A, w) obey some precise scaling with respect to t. 

Since our focus is actually on the support recovery properties of the BLASSO 
when t —)• 0, it is natural to look at the limit of py^t as t —)■ 0. This leads us to 
the {2N — l)-vanishing derivatives precertificate defined below. 

Propositions ((2V — l)-vanishing derivatives precertificate). //I 2 V -1 holds, 
there is a unique solution to the problem 

inf {i|p 1|„ ; (<I>*p)(0) = 1, (<I>*p)'(0) = 0,..., (0) = o} . 

We denote by pw its solution, given by 

PW = '^tN-1^2N (17) 

and we define the {2N — l)-vanishing derivatives precertificate as ijw ”= ^*Pw- 

The following Proposition, which is a direct consequence of Lemma 1 in the 
next section, shows that indeed py^t converges toward pw- 
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t = 0.4 i = 0.2 t = 0.01 


Figure 1: Top row: r]v,t for several values of t, showing the convergence toward 
r]iY- The operator $ is an ideal low-pass filter with a cutoff frequency fc = 10 
(see (9)). 



N = 1 {r]v,t =Vw) N = 2 N = i 

Figure 2: pw for several values of N. The operator $ is an ideal low-pass filter 
with a cutoff frequency fc = 10 (see (9)) 


Proposition 4. IfT 2 N-i holds and ip G KER'^ (for K ^ 2N — IJ, then for 
t > 0 small enough Fi^g has full column rank. Moreover 

lim pv t = Pw strongly in TL , 

(-> 0 + 

lim viy \ = ni^ in the sense of the uniform convergence on X, 
for all 0 ^ k ^ K. 

Figure 1 shows graphically this convergence of rjv.t toward rjw in the case of 
the deconvolution problem over the 1-D torus with the Dirichlet kernel. Figure 2 
and 3 show for several values of N. Notice how it becomes flatter at 0 as 
N increases. This implies that r]v,t for small t gets closer to degeneracy as N 
increases. This is reflected in our main contribution (Theorem 2) where the 
signal-to-noise ratio is required to scale with . 

The behavior of rjy^t is therefore governed by specific properties of 77 ^ for 
small values of t > 0. In particular, as stated by Theorem 1 below, the non¬ 
degeneracy of rjw (as defined in Definition 3 below) implies the non-degeneracy 
of rjv^t (as defined in (16)). 

Definition 3. Assume that T 2 N -1 holds and ip G KER^^. We say that rjw 
{2N — 1)-non-degenerate if r]^^\o) 0 and for all x G X \ {0}, \riwix)\ < 1. 

Theorem 1. Suppose that pw is {2N — 1)-non-degenerate (Definition 3). Let 
Rw > 0. Then, there exist Cw > 0, tw > 0 such that for all t G {0,tw), 
all z G with pairwise distinct coordinates and \z\^ ^ Rw, cind all p G 
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<^2Ar(^) p satisfying for I ^ i ^ N, rjitzi) = 1 and ri'(tzi) = 0, 


V^e {0,...27V}, 


- Vw 


L“(X) 


< Cw 


& X \ |? 7 (a;)| < 1 and VI < i < -/V, rj'ftzi) < 0^ . 


The proof of this theorem can be found in Section A.2. 

Remark 2. An important consequence of Theorem 1 is that if rjw is (2iV — 1)- 
non-degenerate, then, by Proposition 4, riv,t is also non-degenerate for t > 0 
small enough. The function riv,t is then the minimal norm certificate for the 
measure mao,tza, and the Non-Degenerate Source Condition (see [12]) holds. 
As a result, for fixed small t > 0, the BLASSO admits a unique solution in a 
certain low noise regime corresponding to a large enough signal to noise ratio, 
with exactly the same number of spikes as the original measure niaoUzo- 
more details on that matter, see [12]. 

A natural question is whether r]w is indeed (21V— l)-non-degenerate. Propo¬ 
sition 5 below (proved in Section A.3) gives a partial answer in the case of the 
deconvolution over X. 


Proposition 5. Assume that ^ is a convolution operator (i.e. for all x & X, 
(p{x) = (p{- — x) and H = Lf(X)) and I 2 N holds. Suppose also, only in the 
case A = M, that for all 0 ^ i ^ 2N — I, (p^'^\x) —)■ 0 when jxj —>■ -foo. Then 

Remark 3. Thanks to Proposition 5 and the first part of the proof of Theorem 1 
(which is given in Section A.2), note that the following is true: there exist 
Cw > 0, tw > 0 such that for all t G {0,tw), z € with pairwise distinct 
coordinates and jzl ^ Rwi there exists r+ > 0 with r+ > max twZi and 

r~ < 0 with r~ < min twZi such that for all 77 € (X) n W^^’°°(A) 

satisfying for all 1 < z < A, ri{tZi) = 1 and rj'{tzi) = 0: 


Vf e {0,...,2A}, 


- Tlw 


L“(X) 


< Cw 


e (r , r+) \ |J{teJ, l? 7 (a;)l < 1 and Vz G {1,..., A}, Z 7 "(tei) < 0^ . 


Whether the other condition in the definition of the (2A— 1)-non-degeneracy 
of rjw holds (i.e. whether \rjw\ < 1 on A \ {0}) should be checked on a case- 
by-case basis. Since pw depends only on the kernel (p and can be computed 
by simply inverting a linear system, it is easy to check numerically if pw is 
(2A— l)-non-degenerate. Proposition 6 shows that in the special case of A = M 
and the Gaussian kernel, pw can be computed in closed form and is indeed 
(2 A — l)-non-degenerate. 
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Proposition 6. Assume that X = 'R, % = L^(K) and ^ is a convolution 
operator, i.e. for all x G M, 'fix) = ipi- — x), where : x G M i—>■ e““ is a 
Gaussian. Then the associated 12N — 1)-vanishing derivatives precertificate is 


Vx G M, 


_ Gi 

Vw{x) = e 4 


E 


2^' 


(18) 


In particular, pw is {2N — 1)-non-degenerate. 

The proof of this result can be found in the Appendix A.4. If we denote by 
rjwcn the (27V — 1) vanishing derivatives precertihcate associated to the filter 

_ x2 

ipcr '. X a R ^ e then rjw.G = ’7w,i(y)- That is why we only consider the 
case of cr = 1 in Proposition 6. Figure 3 shows rjw for the Gaussian filter with 
an increasing number N of spikes. 

For the deconvolution over the 1-D torus, we observed numerically (as illus¬ 
trated in Figure 2) that pw is (2N — l)-non degenerate for the ideal low pass 
filter for any value N such that N ^ fc (Figure 4 represents the complementary 
case where N is fixed but fc increases). However, for some filters, the associated 
might be degenerate. This is illustrated in Figure 5 where pw is illustrated 
for several filters with increasing complexity i.e. we consider low pass filters with 
a fixed cutoff frequency, with increasing extreme Fourier coefficients (starting 
with a slowly varying filter). Remark that the last two ijw (in red) are degen¬ 
erate, as they correspond to the filters with the higher complexity (the Fourier 
coefficients increase the most with the frequency). 





'N = 7' 

_ x2 

Figure 3: rjw for the Gaussian filter (x G K, fix) = e ^) for several numbers 
of spikes and cr = 1. ryw is i2N — 1)-non-degenerate. It gets flatter at 0 when 
the number of spikes increases. 







|y\/VW'/wwvwN 


/c = 3ff “ 

Figure 4: rjw for the ideal low pass filter as fc increases, for N = 2. 
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Figure 5: rjw for a low pass filter for /c = 10 with increasing high frequency 
content. First row: rjw- Second row: associated Fourier coefficients of the filter. 
The curve showing r]\Y is in blue when it is (2N — 1)-non-degenerate and in red 
when it is degenerate. 

2.3 Main Contribution 

We now state our main contribution. 

Theorem 2. Suppose that ip G and that r]w is (27V — l)-non- 

degenerate. Then there exist constants (ti,t\Y, C, Cn, M) (depending only on p, 
oq and zq ) such that for allO < t < min {ti,tw), for all (A, w) G B(0, Cjit^^~^) 
with Ilf 11^ < C, 

• the problem V\{yt + w) admits a unique solution, 

• that solution has exactly N spikes, and it is of the form nriauz, with (a, z) = 
gf{X,w) (where gl is a function defined onB{0,Ciit'^^~^) C M.xH), 

• the following inequality holds 

\{a,z) - (ao,zo)L ^ ^ + |St) ' 

Note that the value of constants involved can be found in the proof of the 
theorem, more precisely, {ti,C) are defined in (46), Cr is defined in Proposi¬ 
tion 9, tw is defined in Theorem 1 and M is given in Corollary 1. 

The proof of Theorem 2 uses results spanning Sections 3, 4 and 5. Below, we 
give a sketch of proof to guide the reader through the remaining of the paper. 
The elements of the proof are divided in three main steps. 

Step 1 (Section 4.1 and 4.2). We start with the first order optimality 
equation that any solution ma^z, for fixed t > 0, of Vx{yt + w) must satisfy i.e. 

r:, ($t,a - -w) + X = 0. 

It is obtained by applying Fermat’s rule to the problem V\{yt + w). Since the 
parameters (a, z, A, w) = (oq, zq, Or, 0^^) are a solution of the equation, the idea 
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is to parametrize, in a neighborhood of {X,w) = (0r, 0>^), the amplitudes and 
positions (a, z) in terms of (A, w) by applying the implicit function theorem so 
that (a, z, A, w) is a solution of the first order equation. This process is detailed 
in Section 4.1 and 4.2. The rest of the proof consists in proving that the measure 
Wa,tz is the unique solution of the problem Vx{yt + w). But before we have to 
deal with the domain of existence of the above parametrization. 

Step 2 (Section 4.3). The implicit function theorem only provides the exis¬ 
tence of a neighborhood in {X,w) of (0 r, 0^) where the parametrization holds, 
but we do not know how it size depends on the parameter t. This issue is impor¬ 
tant because one of our aims is to determine the constraints on t (corresponding 
roughly to the minimum distance between the spikes of the original measure), 
on the noise level and on the regularization parameter A so that the recovery of 
the support is possible. Section 4.3 is devoted to show that the parametrization, 
which writes (a, z) = gl (A, w) (see Equation (38) for the definition of the im¬ 
plicit function ), of the solutions of the hrst order optimality equation holds 
in a neighborhood of (0^,0^^) and of size proportional to This result 

corresponds to Proposition 9. The proof uses an upper bound of Ag^ which is 
stated in Corollary 1. Proposition 9 relies on asymptotic expansions (of Ttz for 
example), when t —)■ 0, gathered and used in the proof of Lemma 5. Section 3 is 
devoted to these asymptotic expansions and it may be skipped at first reading. 

Step 3 (Section 5). Up to now, we have constructed a candidate solution 
TTia,tz (composed of N spikes) where (a, z) = g^ (A, w) is built by parametrizing 
the solutions of the first order optimality equation. Moreover this parametriza¬ 
tion holds for all {X,w) in a ball of radius proportional to It remains 

to prove that ma,tz is indeed the unique solution of 'P\{yt + w). To prove that 
'ma,tz is a solution, it is equivalent to check that 

0 e (9 i ||$m -yt- wWli^ + X\m\{X)^ {ma,tz), 

which reformulates into ==’ ^4>*(yt -|- w — ^ma,tz) € d\ma,tz\iX)- This is 
done by first showing the convergence of ryAy towards ijw when (t. A, w) —?• 0 
in a well chosen domain, see Proposition 11, and then using Theorem 1 and 
the fact that r/w is ensured to be {2N — l)-non-degenerate (which is one of the 
hypotheses of Theorem 2) to get the non-degeneracy of and the conclusion. 

Putting all together. After this sketch, we now give the detailed proof. It 
uses Proposition 9 (parametrization of the solution of the first order optimality 
equation on a ball, for the parameter {X,w), of radius proportional to 
Proposition 11 (convergence of rjx^t towards ryw), Theorem 1 (use of the {2N—1)- 
non-degeneracy of ijw to transfer it to ry^y), and Proposition 10 (upper bound 
on the error of ma,tz with respect to Wao.tzo)- 
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Proof of Theorem 2. Let us take t, A, w as in the hypotheses of the Theorem 2. 
Let (a, z) = (A,w), where gl is the function constructed in Section 4. Let us 

define 

P\,t '= ^ ($tzo«o + w - ^tzo) and rix,t = ^*Px,t- 
By Proposition 11 combined with Theorem 1 where we take 

Rw = sup{|z|^ ; z e (19) 

we have for 0 < t < min(tiy, ti), 

Va; e a: \ |J{<zJ, |? 7 A,t(a;)| < 1 and \/l ^ i ^ N, ri'f ^^tzi) < 0, (20) 

i 

while r]\^t{tZi) = 1 = sign(ai) by definition. 

We deduce that ijx^t is in the subdifferential of the total variation at ma,tz 
because 

• Vx,t G '^x, 

• II^A,t|lL°°(x) ^ ^ thanks to Equation 20, 

• VI < z < TV, rix,t{tZi) = 1 = sign(ai) by definition of rjx^t (recall that 
(a,z) = gt{X,w)). 

As a result ma,tz is a solution to Vx{yt + w) and px,t is the unique solution to 
the dual problem associated to Vx{yt + w) (see Section 2.4 of [12] for details on 
dual certificates and optimality conditions for Vx{yt +w)). 

Let m be an other solution of Vxiyt + w). Then the support of m is included 
in the saturation points of r]x,t = ^*Px,t be. in {tzi,... ,tZN}. As a result 
w = ma\tz for some a' G and m satisfies the hrst order optimality equation 

Lj^ {^tza' - ^tzgao -w) + X = 0- Hence and since 

has full rank (by assumption t is chosen sufficiently small, see Lemma 1 for 
the proof), is invertible and a' = a. So m = ma,tz and Vxiyt+w) admits 

a unique solution: ma,tz- 

The bound on the error between (a, z) and the amplitudes and positions of 
the initial measure (ao,zo) is a direct consequence of Proposition 10. □ 

2.4 Necessary condition for the recovery in the limit t —)■ 0 

Our main contribution. Theorem 2, states that under a non-degeneracy prop¬ 
erty which involves rjw^ it is possible to perform the recovery of the support of 
a measure mag^tzo ia th® limit t —)■ 0 when the data are contaminated by some 
noise, provided that max(|A|/t^^“^, ||w||.j^ ||zc||.j^ /A) < C for some con¬ 

stant C > 0 depending only on the filter ip and (ao,zo). It is natural to ask 
whether the non-degeneracy condition on rjwj ia order to get the recovery of 
the support in some low noise regime, is sharp. 
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The following Theorem shows that the {2N — l)-non-degeneracy assumption 
on Tjw is almost sharp in the sense that the recovery of the support in a low 
noise regime leads to ||??w|Il“(x) ^ 

Theorem 3. Suppose that I 2 N -1 holds and (p € Suppose also that 

there exists a sequence such that —>■ 0 and satisfying 

Vn e N,3(A„,r(;„),3(a„,z„) € x is solution ofV\„{yt^ + Wn), 

where (A„,w„) — )■ 0 with Then 

llwllL“(jf) ^ 1- (21) 

The proof of this result can be found in Appendix A.5. 

The remaining sections of the paper, namely Sections 3, 4 and 5 are devoted 
to the proof of Theorem 2. 


3 Preliminaries 

Our study relies to a large extent on the asymptotic behavior of quantities 
built upon and Tj^ for t > 0 small, such as or (Tj^Tt^)”^. In 

this section, we gather several preliminary results that enable us to control that 
behavior. 


Approximate Factorizations. Our asymptotic estimates are based on an 
approximate factorization of and T x using Vandermonde and Hermite matri¬ 
ces. It enables us to study the asymptotic behavior of the optimality conditions 
of V\{yt + w) when t —> 0+. In the following, we consider z G (see (6)) and 
t G (0,1], so that JItz is always invertible. Moreover, we shall always assume 
that (p e KER2^. 

Proposition 7. The following expansion holds 


Tiz — '^2N-lHtz + ■^t,zDt, 

where T 2 Ar-i is defined in (11), Htz is defined in (12), and where 
A ‘Ifl- 1 a2JV,„(2JV)^„z„ \ (1 “ 


( 22 ) 


( J p^'^^^stZi)- 
( J iZif^~^p‘^‘^^^stZi) 


■ rl s 

(2fV—1)! J is^is^N 

(1 - s)2^-2 


—d.s 

(2fV — 2 )! J Is^is^N 


Dt = diag(t 


2N 
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Proof. This expansion is nothing but the Taylor expansions for ip and p': 

p{tZi) = po + itZi)pi + . . . + i), V?2jV-l 

/•I (^ _ 

+ {tZif^ 1 )! 

p'ftzi) = pi + {tZi)p2 + ... + (2jV - 2)! '^^^"^ 

+ {tZif^-^ ds. (24) 

□ 


The above expansion yields a useful factorization for Vtz, 

where '^tz =' ^27V-i + K,zDtH~J^. 


The rest of this section is devoted to the consequence of that factorization for 
the asymptotic behavior of Ft^ and its related quantities. The main ingredient 
of this analysis is the factorization of Ht^ as 


Htz = diag(l, diag 1,..., 1, 


(25) 


Let us emphasize that our Taylor expansions are uniform in z € Bz^- More 
precisely, given two quantities f{z, f), g{z, t), we say that f{z, f) = g{z, t)+0(t^) 
if 

\f{z,t) - g{z,t)\ 


limsup sup 

t-^o+ zeB^n 




< + 00 . 


Lemma 1. The following expansion holds for t —> O'*", 

-^tz = ■^2N-i+0(t). 


(26) 


Moreover, ifl 2 N-i holds then '^tz andVtz have full column rank fort > 0 small 
enough and 

= («'2W-i^2Ar-i)-^ + 0{t) (27) 

^tz’* + Oit). (28) 

Proof. We begin by noticing that 

Kt.zDtHf^,^ = f^At,zHf^ diag(l, 1/t,..., 1/f^-^) 

= At,zHf^diag{t^^,f^-\...,t). 
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The function 2 ; !->• H~^ is " 10 °° and uniformly bounded on and {z, t) !->■ At^z 
is 'rf^ on the compact set Bzq x [0,1] hence uniformly bounded too. As a result, 
we get (26). 

Assume now that I 2 N -1 holds. Since 'I' 2 Ar-i'^' 2 Ar-i is invertible, there is 
some R > 0 such that for every A in the closed ball B('I' 2 Ar-i'^' 2 Ar-i, .R) C 
^( 2 N-i)x{ 2 N-i)^ jg invertible. By the mean value inequality 

||(^ 2 JV- 1 ^ 2 Ar-l)”^ - A-l|| ^ _ sup \\B-\A - 

^ sup 

Applying that to A = = 'I' 2 jv-i^ 2 Ar-i + 0{t) (since each term in the 

product is uniformly bounded), we get (27). 

Now, for the last point, we infer from Ttz = "^tzHiz and the fact that Htz is 
invertible that T)'),’* = Hence 

(V) = = '^tzm.'^tzr^s^N 

where d 2 N is defined in (13). 

Each factor below being uniformly bounded in Bzq x [0,1], we get 

^tz’* = ('I' 2 Ar-l + Oit)) + Oit)) 62 N 

= 'I'2Ar-l('I'2Ar_id^2Ar-l) ^52N+0{t). 


\B 


-i| 


^2Ar-1^2Ar-l 


□ 


Projectors. In this paragraph, we shall always suppose that I 2 N -1 holds. 
Another important quantity in our study is the orthogonal projector P(inirt^)-L 
(resp. P(im^ 2 jv_i)i) onto (IrnTtz)^ (resp. (Im^' 2 Ar_i)-^). We define 

n*. = = Id« - rt,(rLrt,)-irL, 

n2Ar-l == R’(Im«'2N-i)^ = - 'l'2Ar-l('l'2JV-l'E'2Ar-l)~^'I'2Ar-l- 

Observing that P(imrtj)J- = > we immediately obtain from the previous 

Lemma that litz = n 2 jv-i + 0{t). 

By construction, Btz^tz = Atz^'tz = 0) but the following proposition shows 
that this quantity is also small if we replace ^tz with . 

Lemma 2. There exists a constant Li > 0 (which only depends on ip, ao and 
Zq) such that 

uniformly in z & Bzg ■ 
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Proof. Applying a Taylor expansion to , we write 


where 


Hence 


Vt. = 


/ 0 
0 
1 


\ (2Ar-3)! 


0 \ 
0 
1 


{2N-3)\ ) 




-ds 


(29) 


nt,$" = + {-^N-i - -^tzWtz + t^^-^At,z) 

= ^tz{-At,zDtH~^Vt^ + since Utz'i'tz = 0. 


Using (25), we see that 


DtHtJ^Vtz = ^ diag(t^^,t' 


2N J.2N-1 




= t 


2n-2h-1v hence 


= t 


2N-2 


Utz{-At,zHf^Vz + At,^)- 


Since ||ntz|| < 1 and the continuous function {z,t) i-> —At^zH^ ^Vz + At^z is 
uniformly bounded on the compact set Bzo x [0,1], we obtain 


in 


tz^tzlln ^ 


sup 

At' ,z' Hzi^Vz' + At'^z' 

\(z',t')eB.oX[0,l] 



J.2N-2 


H 


□ 


We study further the projector litz when it is not evaluated at the same 2 : 
as Ttz- 

Lemma 3. If (p G then there is a constant L 2 > 0 (which only 

depends on p, ag and Zg) such that for all z € Bzg, all t e (0,1], 


lAtzUtzo 



< L 2 t^^ jz 
n 


^Oloo 


Proof. Let us observe that 


^tz^tzQ {'I’2N-lHtzo + At^zoDt) 

= litzi'I’tzHtzo + ('I'2Ar-l — '^tz)Htzo + At^zoDt) 

= litzi—At^zDtHtz^Htzo + At^zoDt) since Utz^tz = 0- 
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Observing that 

diag (1,..., 1, l/t,..., 1/t) = 


we get 

^tz^tzo = Iltz (Aj 2p(Id2Ar — ^Hzg) + {-^t,zo ~ ^t,z)H^ Df. 

For k e {2A^ — 1,2A^}, the function {u,s,t) ^ ‘p^'^^\stu) is defined and 

on the compact sets (since € KER^'^"*'^) 


V1 < i < iV, 



ZOi + 



X [0,1] X [0,1], 


where Ag is defined in (5). Thus there is a constant C > 0 (which does not 
depend on t nor z G Bz^) such that 



hence 


(1-s) 


k-1 


-d5 


< c jz,: 


l!At..„-At,,ll <Clz. 


-^0 z I t 

^Oloo 


As a result, since Ullt^jj < 1 and z^ ^Hz^ is bounded on Bz^, 

\\Btz{htzo-^t,z)H~'^Hza\\^^C sup \\H~^Hzo\\\z- Zq\^. 

As for the left term, Ktzo is bounded uniformly in t G [0,1], and the mapping 
z i-G H~^Hzg is on Bz^- As a result, there is a constant C > 0 such that 

yzeBzo, \\ldN - H~^Hzo\\ C\z - zo\^. 


To conclude, we observe that Dt 
above inequalities to obtain 



= 



and we combine the 


^tz^tzo 




(c sup \\H/Hzo 

V ^' 66^0 


+ C sup ljAt,,Jl 

‘6 [0.1] , 


,2N 


z - Zol 


□ 


Asymptotics of the vanishing derivatives precertificate. We end this 
section devoted to the asymptotic behavior of quantities related to T tz by study¬ 
ing the second derivative of the vanishing derivatives precertificate r]v,t (see 
Definition 2, and [12] for more details). Theorem 1 ensures that the second 
derivatives of r]v,t do not vanish at zo,i, 1 ^ ^ A. However, it does not pro¬ 

vide any estimation of those second derivatives. That is the purpose of the next 
proposition. 

In view of Section 4, it will be useful to study those second derivatives not 
only for the precertificates that are defined by interpolating the sign at tzg but 
more generally for the precertificates that are defined to interpolate the sign at 
tz for any z G Bzq ■ 
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Proposition 8. Assume that tp € and that I 2 N -1 holds. Then 


n *T^+,* / 




f^N\ ,2N-2 I /n/+2Ar-l\ 

1 0 j ^ ^ ) , 


(30) 


where d, € 


oAf 


, d^. 

^y..i — 


(27V)! 


Y\{zi - Zjf for 1 ^ i ^ N. (31) 




Proof. We proceed as in the proof of Lemma 2 by writing = 'I' 2 Ar-iV'tz + 
(see (29)) and = TtzHf^. We obtain 

The first term yields 


j = (^"0'j = Vtz* 62 N = 0. (32) 

As for the second term, we take the Taylor expansion a little further (using 
integration by parts), 


-1 Tj-i,* I 


f p''‘^^\stZi) 
Jo 


(1-s) 


2N-3 


-ds = 


^ 2 N 


(2A^-3)! (2A^-2)! 

tz^ [ p 


(1 „\2N-2 

ds. 


{2N -2)\ 


so as to obtain 

At,z = {p 2 N, ■ • ■, T 2 n) diag(e^) + Oft ), where 


\2N-2 


d_et. f (Zi) 

[(m- 2 )! 


T,N 


l<i<N 


and, as usual, 0{t) is uniform in z € Bzq. From Lemma 1, we also know that 


-!+,♦ I Hat 

0 


A* r 

tz 


= pw + 0{t), hence 

{T2N,Pw)'h 


* r+-* ^ = diag(e,) 


+ O{t)=7^<^^'>{0)ez + O{t). (33) 


fP2N,Pw)u 

Now, we proceed with the last term. Similarly, by integration by parts, 
At,z = {p2 N ■ ■ • P 2 n) diag(/z) + 0(f). 
where fz is defined by 

'(Zl)"^ {ZN?^ 


fz — 


(2Af)! ’ ■ ■ ■ ’ (27V)! ’ (2Af - 1)! ’ ■ ■ ■ ’ {2N - 1)! 


i)2Ar 


(34) 
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Hence, 


A* r+’* - 

tz 


{V’2N,Pw)-h 

= diag(/^) I : | + 0 {t) = + 0{t). 

{V2N,Pw)u/ 


To conclude, we study Vz*H~^’* (which is uniformly bounded on Bz^)- In 
IR 2 Ar-i[-^] endowed with the basis ^1, X,, (fjv^rfyr) ; Ht Ihe matrix of the 
linear map which evaluates a polynomial and its derivatives at {zi,... ^Zn}- 
On the other hand 14* represents the evaluation of the second derivative at 
{zi,...,Zn)- Thus, 

Vz*H-^’*fz = 

where P is the unique polynomial in IR 2 Af-i[-^] which satishes 

( A2N 

V* = l,...,fV, = ^ and 


(2fV)! 


(2Af- 1)!' 


One may check that 




2N 


N 


i2Ny. (27V)! 


^2N-2 


and P''{zi) = 


{2N-2)\ (27V)! 


3¥=i 


As a result, 


-H/i?4'’*A*,r+/ (^Y) = - e.) + 0 {t ), 


(35) 


where 0{t) is uniform in 2 € Bzq- We obtain the claimed result by summing 
(32), (33) and (35). □ 


4 Building a Candidate Solution 

Now that the technical issues regarding the asymptotic behavior of T^^ have 
been settled, we are ready to tackle the study of the BLASSO. In this section, we 
build a candidate solution for Vx {yt + w) by relying on its optimality conditions. 

4.1 First Order Optimality Conditions 

The optimality conditions for Vx{yt + w) (see [12]) state that any measure 
of the form ma,tz = is a solution to Vxiyt + w) if and only if the 

function defined by px,t ”= ^*P\,t with px^t "= y (* 1 * 420^0 + w — ^tzCi) satishes 
||??A.t|lLoo(^) 4 1 and yx,t{tzi) = sign(ai) for all 1 4 7 4 TV. 


23 



Observe that we must have = 0 for all 1 ^ i ^ N. Moreover, in our 

case, since we assume that ao,i > 0 we have in fact ri\^t{tzi) = 1. 

In order to build such a function r]\^t, let us consider the function ft defined 
for some fixed t > 0 on x M x "H by 


ft{u, v) =' Tt^ {^tza - ^tzoao - w) + X ^ (36) 

where u=(a,z) and v = (X,w). (37) 

Now, let us write Uq '= (ao,Zo). Notice that rria^tz is a solution to V\{yt + w) 
if and only if ft{u,v) = 0 and ||?7A,i|lL“(x) ^ strategy is therefore to 

construct solutions of ft{u,v) = 0 and to prove that ^ 1 provided 

(A, w) and ijw satisfy certain properties. More precisely we start by parametriz¬ 
ing the solutions of /((m, v) = 0, in a neighborhood of (uq, 0), using the Implicit 
Function Theorem. 

The following Lemma 4 (whose proof is omitted and corresponds to simple 
computations) shows that ft is smooth and gives its derivatives. 

Lemma 4. If ip & for some k gW then ft is of class and for all 

iu,v) G X (M X -H) 


duft{u,v) 
dyft{u,v) 

, j drf. /ld]v 0 \ 

where Jta — I ^ / \ I • 

y 0 tdiag(a)y 

4.2 Implicit Function Theorem 

Suppose that I 2 N -1 holds and tp G By the results of Section 3, 

there exists 0 < to < 1 such that for 0 < t < to and all 2 G Bz^, Ft^Ttz is 
invertible. In the following we shall consider a fixed value of such to provided 
by Lemma 5 below which also ensures additional properties. 

Now, let t G (0,to) be fixed. By Lemma 4, ft is 9«/t('Uo,0) = 

^tzQ^tzoJtao is invertible and /t(uo,0) = 0. Hence by the Implicit Function 
Theorem, there exists Vj a neighborhood of 0 in M x "H, Ut a neighborhood of 
uo in s-ad gt '-Vt ^ Ut a. function such that 

y{u,v) €Ut xVt, ftiu,v)=0 u = gt{v). 

Moreover, denoting dgt the differential of gt, we have 

Vu G Vt, dgt{v) = - {duft{gt{v),v))~^ dyft{gt{v),v). 


fO diag(4>(*($tza-$tzoao-ic))\ 
1^0 diag($"*($tza - 4>izoao - w))y 
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4.3 Extension of the Implicit Function gt 

Our goal is to prove that ma,tz is the solution of the BLASSO, where (a, z) = 
u = gt{v). To this end, we shall exhibit additional constraints on € Vt, such 
as the scaling of the noise ||w||^ or A with respect to t, in order to ensure that 
^ 1- However, the size of the neighborhood Vt provided by the 
Implicit Function Theorem is a priori unknown, and it might implicitly impose 
even stronger conditions on A and w as t —>■ 0^. 

Hence, before studying whether ||? 7 A,i|lLcxj(j, 5 -) ^ 1, we show in this section 
that we may replace Vt with some ball with radius of order and still have 

a parametrization of the form u = gt{v) satisfying ft(gt(v),v) = 0 where ft is 
defined in (36). 

Let Vf = Uv 6 v where V is the collection of all open sets H C M x "H such 
that 

• 0 G y, 

• y is star-shaped with respect to 0 , 

• y C B(0, where Ct > 0 is a constant defined by Lemma 5 

below, 

• there exists a function g : V ^ such that 5 ( 0 ) = Ug and 

ft{g{v),v) = 0 for all v & V, 

• giy) c X 

Observe that V is nonempty (by the Implicit Function Theorem in Section 4.2) 
and stable by union, so that Vf G V. Indeed, all the properties defining V are 
easy to check except possibly the last two. Let y y G V and g, g be corre¬ 
sponding functions. The set |v G y H y ; g{v) = 5 (u)| is nonempty (because 

g{0) = ug = 5 ( 0 )) and closed in y n y. Moreover, it is open since for any 
V G y n y, V G B(0, and, by Lemma 5 below, the Implicit Function 

Theorem applies at {g{v),v), yielding an open neighborhood in which g and g 
coincide. By connectedness of y fl y, g and g coincide in the whole set y n y. 
As a result, the function g^ : Vf —)■ (M^)^, defined by 

gl{v) ‘^= g{v) if u G y, y G V, and g is a corresponding function, (38) 

is well defined. Moreover, gl is and gl{Vf ) C Ba^ x Bz^. 

Before proving that Vf contains a ball of radius of order and studying 

the variations of gl, we state Lemma 5 mentioned above. 

Lemma 5. Ifl 2 N-i holds and G then there exists tg G (0,1) and 

Ct > 0 (which only depend on (p and ug) such that for all t G (0,to)> */ 

u = (a, z) G Bag X Bzo and v = (A, w) G B(0, CTt ^^~^), (39) 


25 



then the matrix 


GUX, w) = (40) 

where (41) 

and qt:, = Ar:;+ + nt.w + , (42) 

is invertible and the norm of its inverse is less than 3 || (4'2jv_i^'2Af-i)~^ || • 

If, moreover, ft{u,v) = 0, then 

duft{u,v) = Hf,,GtziX,w)HtzJta 

and this is an invertible matrix. 

Let us precise that by {\,w) € B(0, , we mean that |A| < GTt^^~'^ 

and ||w||^ < GTt^^~‘^. 

Proof. We consider to G (0,1) small enough so that for 0 < t < tg and all 
Tl,,Ttz is invertible and 

||(^'t^^'t 2 )“^|| < 2 ||(^' 2 ^_i^' 2 Ar-i)“^|| by Lemma 1, (43) 

^ Proposition 8. (44) 

In the last equation, we have used the fact that {Zr-ZjY\ < {2Rwf^ 

(where Rw = sup{|z|^ ; z G Bz^} is defined in Equation (19)). 

We also know that for some constants Li,L 2 > 0 which only depend on (p 
and Mo, 

\^'ffntzw\^ ^ by Lemma 2, 

and ^'ffUtz^tzo ( ^ LiL 2 ^t'^^~^ by Lemma 2 and Lemma 3, 

for all z G Bzq = B(zo, ^), t G i0,to), where Aq is defined in (5). 

Combining those inequalities with (44), we see that 

= A<i>;';r:;+ (*") + ‘t":n,ew + (“") 

s iAi ^ |.fr’(o)| +ii«"ii« 

On the other hand, since 

= diag (1,..., Hl'-^ diag (1,..., 1, t,..., t), 

^‘^=(o -diag(%;fe))’ 


26 



we get 


T7 7—1 77—1 
^tz ^tzJta ^tz 


= tdiag(i....,^2^)i7;’ 


so that 




-1 rr-l 
tz'^ta ^tz 


< 


^iN-i 


ll^z'i Kzfc 


< c 


2N 


|A| 4(2i?) 
^ 2 N -2 (27V)! 


17^(0) 




An 


f2N-2 




withC = sup(„_,)gg g 1 


\H 


-i| 


,-i| 


Possibly choosing to a bit smaller, we may assume that 

0 < CL^L^^tl < —^ 

d 8 ||(lE'2Ar_ill'27V-l) ^1 


As a consequence, there exists Ct > 0 such that for all t € (0,to)) and all 

{ci, z) G Bag X Bzg, 


max 


f |A| 




n 


1 j-2N-2 ’ ^2N-2 


< Ct 




< 


4 ||(^ 2 JV - 1 ^ 2 Ar - l ) ^ 


Then, recalling (43) and setting 




r = 


< 


Jzltz ^tz’Jta 
\-ll 


HViz) 


1 

2 ’ 


we see that the matrix (40) is invertible, and 


(ld2N + t Hlf^FtzJCaH^,^) 


+ 00 


fc =0 



3 

2' 


Eventually, using (43) again, we obtain that the norm of the inverse of (40) 
is less than 3 ||(4'2jv_i'I'2Ar-i)“^||■ 

Now, if ft{u,v) = 0, then $(*($t 2 a — ^tzgC^o — w) = 0, so that thanks to 
Lemma 4 we obtain 


duMu,v) = rl^TtzJta + t 


0 

diag(4>"* (4*7^0 - 4>t2oao 
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Moreover, 


^tza - -w = Ttz j - w 

= ^tz ^tz^tzo ( 0 ] V 0 / ~ V 0 / ~ 

TT T' I ^0 \ IT XT'*’"!” I ^-^4 

^tz^tzQ I Q I ^^tz i 0 / ^tz- 


As a result, 

duMu, v) = Hl^l^tzHtzJta + t ( 0 - diag($-,,.) ) 
= HlGtz{,\w)HtzJta, 


and duft{u,v) is invertible. □ 

We may now study the variations of g^. 

Corollary 1. Ifl 2 N-i holds and ip G KER^^"*"^ then there exists M > 0 (which 
only depends on p and Uq), such that for 0 < t < tg, for all v G Vf 

Proof. Let us recall that by construction, Vf C B(0, • Thus, from 

Lemma 5, we know that for all v G Vf, duft{gt{v),v) = Hf^Gtz(,^,w)HtzJta, 
where (a, z) = g({v). Since Agl{v) = - {duft{gt{v),v))~^ dyftig({v),v), we get 


dgtiv) = -Jf(^Hf;^GUX,w)-^H:r^ 

= diag Gtz{X, {52 n, ^'D > 

Using Lemma 5 and the fact that J~^,H~^,^tz are uniformly bounded on 
Bao X ^zoj we obtain the claimed upper bound of ||d( 7 (*(v)|| for all v G Vf. □ 

We are now in position to prove that Vf contains a ball of radius of order 

^2N-l_ 

Proposition 9. Ifl 2 N-i holds and ip G there exists Gr > 0 such 

that for all t G (0, to), 


B{o, Gr^^-^) c v; 


with 


/^ min^(oo^i) Ct \ 
\4M' AM "to ) ' 


Proof. Let v gM. x H with unit norm (i.e. max(A, HicH^) = 1), and define 


Rv =’ sup {r ^ 0 ; rv GVf} . 
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Clearly 0 < ^ Assume that Ry < Then by Corollary 1, 

gf is uniformly continuous on V^*, so that the value of gl{Ryv) can be defined 
as a limit, and ft{gl{Rvv), Ryv) = 0. 

By contradiction, if gl{Ryv) € Bag x Bzg, then by Lemma 5, we may apply 
the Implicit Function Theorem to obtain a neighborhood of {g^{Ryv),Ry) in 
which g^ may be extended. This enables us to construct an open set V € V 
(in particular we may ensure that V is star-shaped with respect to 0) such that 
V-t £ which contradicts the maximality of V^*. 

Hence, gt{Ryv) G d{Bao x = {d{Bao) x B^g) U {Bag x d{Bzg)). Assume 
for instance that gl{Ryv) G Bag x d{Bzo) (the other case being similar). Then, 
for {a,z) = g^iRyv), 

A A4 

^ = l^-^oloo< l^atisRvv)-Ryv\^ds^ ^^^j^Ry, 

which yields Ry ^ Similarly, if gl{Ryv) G d(Bag) x Bzg, we may 

prove that Ry > 

Eventually, we have proved that for all G K x "H with unit norm. 


Ry ^ min 


Ao 2Ar-i niinj(ao,i) 


.2N-1 


4M 


4M 


Ct^ 


2N-2 


and the claimed result follows. 


□ 


4.4 Continuity of at 0 

Before moving to Section 5 and to the proof of ||^A.i|lL“(x) ^ ^ (which 
ensures that ma^tz is ^ solution to the BLASSO), we give a first order expansion 
of our candidate solution u = gt{v) for all v G B{0, 

Proposition 10. If I 2 N -1 holds and ip G then for all t G (0,to)) 

V G H(0,C'i^^2W-l), 

b:(z;) - g**(0)|| < M . (45) 

Proof. To show Equation (45), it suffices to write 

9t (v) = 9*t (0) + [ dgt {sv) ■ vds, 

Jo 

and use Corollary 1 to conclude. 

□ 


5 Convergence of rjxg to rjw 

It remains to prove that ma,tz where (a, z) = gl (v) is indeed a solution to 
RxiVt + w). To show this statement, we prove that g\^t converges towards gw 
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when {t, X,w) —?■ 0 in a well chosen domain. This section is devoted to this 
result. The proof of our main contribution, Theorem 2, which can be found in 
Section 2.3, uses this convergence result and the assumption that rjw is (2iV—1)- 
non-degenerate to conclude. 

Proposition 11. Assume that (p G KER^^'*"^ and that I 2 N -1 holds, and let 
Cw > 0 be the constant defined in Theorem 1, gl, to > 0 and Cr > 0 be the 
function and constants defined in Section 4- 

Then there exist constants ti € (0,to) (ind C > 0 (which depend only on (p 
and uoj such that for all t € (0,ti) and for all {X,w) G B(0, CRt^^~^) with 
llxll-x/ ^ C, the following inequalities hold 


VtG {0,...,2Af}, 


(^) 

Txt 


W 


L<»(X) 




w, 


with gx,t = ^* (i {^tzoao + w - $ 4 ^ 0 )) and (a,z) = g((X,w). 

Proof. Let t G ( 0 ,to )5 v G B(0, and {a,z) =u = gl(v). Then, using 

ft{u,v) = 0 (see (36)), we get 

PX,t =' ^ (^’tzo«0 + w - ^tzO) 

^ X (0°) + (0)) 

— (^N ^iTT ^i^TT p 


Hence, 


Ibv.t 


PwWu ^ 



- Pw 


n 



w 

A 


n 


1 . 




tZ-^ tZQ 



n 


From Lemma 1, there exists Cy > 0 and ty > 0 (which only depends on p,Uq) 
such that for all t G (0, ty) and all 2 ; G Bz^, 


)~l“ 


Iat 

0 


-Pw 


^ Cyt. 
n 


Moreover, since litz is an orthogonal projector. 


n* 




H 


H 


and by Lemma 3, there exists L 2 > 0 (which only depends on <p,Uo) such that, 
for all t G (0, to) 


1 

X 


fltzl^izo 



< \z-Zo[ 

n 1^1 

-^2 ,2N ^ 


^ |y|^ + ll^llw) by Proposition 10 


< L2Mt (^1 + 
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Gathering all these upper-bounds, one obtains 


\\p\,t ~ PwW-f^ ^ (Cv + L2M)t -|- (1 -l- L2Mt) 

^ {Cv + L2M)t -l- (1 -l- L 2 M) 


w 


H 


H 


Now, denoting by : M.{X) —> % the operator m ^ {x)dm{x) and 

by : Pi —<rfx its adjoint (so that and we 

let 


^ . det. 

K = max sup 
0^K2Nx^X 


(x) 


n 


W W 

Vx,t - Vw 


L“(X) 


which satisfies K < -l-oo because (p G Then, for all f G {0,..., 2N}, 

< K ||pA,t -PwWfi 

^ K -|- L2M)t -f (1 -|- L 2 M) — ^ . 

As a consequence, by taking t smaller than min(to, W, 2 K{cp+L 2 M) ) 

{X,w) G such that 

I w 

A 


(1 -f L 2 M) 


< 

« " 2 X’ 


we get 


W W 

Vx't - Vw 


L“(X) 


< Cw- 


Remark 4. The constants involved in Proposition 11 are 

det. . , Cw , J d't- 

*' = C = 2K{1 + Lwy 

They only depend on p and Mq. 


□ 


(46) 


6 Conclusion 

In this paper, we have proposed a detailed analysis of the recovery per¬ 
formance of the BLASSO for positive measures. We have shown that if the 
signal-to-noise ratio is of the order of then the BLASSO achieves per¬ 

fect support estimation. This results nicely matches both Cramer-Rao lower- 
bounds [2], bounds for combinatorial approaches [10] and practical performances 
of the MUSIC algorithm [7]. It is also close to the upper-bound obtained by [19] 
for stable recovery on a grid by the LASSO. We have hence shown that these 
bounds do not only imply stability, they imply exact support recovery for the 
BLASSO, if r]w is (2A^ — I)-non-degenerate. We have observed numerically that 
this condition holds for the ideal low-pass filter and that this is the case for a 
large class of low-pass filters. We have showed that this is true for the Gaussian 
kernel by providing the expression of r]w- 
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A Proof of the Results of Section 2 

A.l Proof of Proposition 1 

The functions (ipo,... ,ipk) are linearly independent in L^(T) if and only 
if their respective Eourier coefficients are linearly independent in £^(Z). If 
(cn[</3o])nez denotes the Fourier coefficients of (po, the Fourier coefficients of 
(fij are given by ((2i7rn)'^c„[v3o])„gg (with the convention that 0° = 1). 

If ipo has fc -|- 1 nonzeros Fourier coefficients corresponding to pairwise dis¬ 
tinct frequencies (no,.. .n^), those Fourier coefficients are given by the matrix 
product 


/ C«o [</5o] 

■ ”1 

A 

1 

(2i7rno) 

(2i7rni) 

(2i7rno)^ . 

(2i7rni)^ 

. (2i7rno)'' \ 

. (2i7rni))'= 

V 0 

CndVo]/ 


(2i7rnfc) 

(2i7rnfc)^ . 

. (2i7rnfc)'= / 


Both the diagonal and the Vandermonde matrices are invertible, hence the fam¬ 
ily of Fourier coefficients of ((/?o, • ■ •, Pk) is linearly independent. 

Conversely, ifXfe holds, one can hnd k + 1 Fourier coefficients, corresponding 
to some frequencies uq, .. .Uk, such that the matrix (c„^ i® invertible. 

From the above factorization, we deduce that each [t^o] must be nonzero for 

o^e^k. 

A.2 Proof of Theorem 1 

The proof proceeds in two steps. First we show the result locally around 0 
in X thanks to < 0 (because ^ 0 and \'qw\ < 1 on AT \ {0}) 

and then we extend the result to all X thanks to \'qw\ < 1 on V \ {0}. 

Locally. Let us prove that there exist Cw > 0, tw > 0 such that for all 
t € z G with pairwise disctinct coordinates and \z\^ < Rw, there 

exist r+ > 0 with r'^ > max twZi and r“ < 0 with r~ < min twZi such that 

l^i^N l^i^N 

for all r] G 'rf‘^^{X) C W^^’°°(X) satisfying for all 1 < i < V, ri{tzi) = 1 and 
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ri'{tzi) = 0, the following implication holds, 




- Vw 


L~(X) 


^ a 


w 


G (r \ [J{tZt}, \r]{x)\ < 1 and VI < i < TV, ri''{tzi) < 0^ . 


First, we prove that, provided Cw > 0, > 0 are small enough, 77' has 

exactly 2N — 1 zeros in (r“,r“''). 

Let t > 0 and 77 G n and z G with pairwise distinct 

coordinates and \z\^ ^ Rw such that for all 1 < t < A^, ri{tzi) = 1 and 
ri'{tzi) = 0. We suppose that Zi < Z 2 < ■ ■ ■ < z^. By Rolle’s Theorem, for all 
1 < 7 < iV — 1, there exists Ci{t) G {tzi,tzi+i) such that ri'{ci{t)) = 0. As a 
result ry' has at least 2N — 1 zeros in all {r~,r~^) satisfying the requirements. 

Now, let us assume by contradiction that rj' has strictly more than 2N — 1 
zeros for arbitrarily small values of Cw, tw and in all (r~,r+) satisfying the 
requirements. As a result, there are sequences (tfe)fc6N where tk —>■ 0, (zfe)fc6N 
(where each Zk G has pairwise distinct coordinates and \zk\^ < Rw), 
and (r^)fc6N where for all /c G N, > ^^J-k{.Zk)i, -> 0 

and r7 < 0, rZ < min tk(zk)i and rZ —)• 0. And there exists ir]k)ken V 
n W^^’“(A))” such that for all fc G N 
Vi G {!,...,iV-1}, r7fc(4(zfe),) = 1 and T]'^{tkizk)i) = 0, 


V£g {0,...,27V}, 




Vk 

Vw 


1 

L°°(JC) k 


and ?7(, has at least 2N zeros in {vf. , r'jz ) (we already know that ry^, has at least 
2N — 1 zeros in (t'^,?’^)). Thus, for all fc G N, by applying successively Rolle’s 
Theorem, we obtain that there exists Xk G (?'fc ,r^) such that rjl^^Hxk) = 0. 
Since Xfc —>■ 0 (because r'j^ ,r^ —>■ 0) and 

that = 0, which is a contradiction. Hence ry' has exactly 2N — 1 zeros 

in some (r“, r’*'). 

Using the same argument, we may also prove that for all i G {!,..., N}, 
ri''{tzi) 7^ 0. Let us now observe that, either for all 1 < i < A^, ri"{tzi) > 0 or, 
for all 1 ^ i < Af, ry"(tzi+i) < 0. Indeed, assume for instance by contradiction 
that for some 1 ^ i ^ N — 1, ri"{tzi) > 0 and ry"(tzi+i) < 0. Then, there 
exists Ci{t) G {tzi,tzi+i) such that ri{ci{t)) = 1. Applying Rolle’s Theorem 
on respectively (tZi,Ci{t)) and (ci(t),tZj+i), we obtain that ry' vanishes at least 
twice in {tZi, tzi+i). It is a contradiction with the fact that ry' has exactly 2N — I 
zeros in (r^jr"*") for all 0 < t < trv- 

As a result, there are only two possibilities: either r]"{tZi) < 0 for all i (and 
then for all x G (r“,r+) \ [Ji{tzi} ri{x) < I), or ri''{tzi) > 0 for all i (and then 
for all X G (r“,r+) \ [j^{tzi} ri{x) > I). But since ry^^^(O) < 0, there is some 


A2N) 

A-zn) 

Vk 

Vw 




^ j:, we deduce 
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X e (r“,r+) and some £o > 0 such that r]w(x) < 1 — eo- Choosing Cw small 
enough so that Cw < £o/2, we obtain that 'q{x) < 1 — eo/2. As a consequence, 
for all {r~ ,r~^)\[Ji{tZi}, r]{x) < 1. Finally we can suppose that ? 7 i^(a;) > —1 
on (r“, r+) by imposing 0 < Cw < infx Vw + 1- 
To sum up, we have proved the following : 

Vx G {r~\ |? 7 (x)| < 1 and Wl ^ i ^ N, rj'[tZi) < 0. (47) 


Globally. As rjw is non-degenerate, we have supx\(r-,r+) l''7w| < 1- We can 
assume that 0 < Cw < (l — sup^^^^-,,+ ) |? 7 w |)/2 and use ||?7 — ?7w|Il°°(jc) ^ 
so as to obtain 


yxGX\{r ,r+), \r]{x)\ < 1. (48) 

Gathering Equations (47) and (48), we obtain the claimed result. 

A.3 Proof of Proposition 5 

The proof of Proposition 5 relies on the study of the structure of the matrices 
We start by introducing the notion of checkerboard matrices. 

Definition 4. Let n G N* and A = G We say that A is a 

checkerboard matrix if for all {i,j) such that i + j is odd, Oij = 0. 


For an odd n 

, such a 

matrix looks like 




( 01,1 

0 

Ol,3 

0 

0 

Ol,n-2 

0 


0 

02,2 

0 

02,4 

02,n-3 

0 

02,n-l 

0 

0 

an-1,2 

0 

On-1,4 

• ■ On—l,n—3 

0 

On—l,n—; 

L 0 

1 

0 

On,3 

0 

0 

On,n—2 

0 

^n,n / 


Lemma 6. The set of checkerboard matrices of size n G N* is an algebra over 
M. As a result the inverse of a checkerboard matrix is also a checkerboard 
matrix. 

Proof. The only difficulty is to show that the product of two checkerboard ma¬ 
trices is also a checkerboard matrix. Let A = (aij)ij and B = (bij)ij be two 
such matrices. Let (i,j) such that i+j is odd. Then X]fe=i <^i,kbk,j = 0 because 
if f -I- fc is even then (i + j) — (i + k) = j — k is odd and j — k +2k = j + k 
is odd, hence bj^k = 0. On the contrary, if i -|- fc is odd then Oi^k = 0. So AB 
is a checkerboard matrix. The last statement holds because the inverse of any 
matrix is a polynomial in that matrix. □ 

Now we give a more precise result on the structure of some particular checker¬ 
board matrices. 
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Lemma 7. Let A = (aij)i^ij^ 2 n+i be a symmetric positive-definite checker¬ 
board matrix of size 2 n + 1 such that 

Qi , = (—l)^Oi+j i+j 
2 ’ 2 

for all i,j G {l,...,2n+ 1} such that i-\- j is even, and let 
denote the entries of A~^. Then & 2 n+i, 2 n+i and 5 i, 2 n+i are positive. 

Proof. The fact that 62 n+i, 2 n+i is positive, is a direct consequence of the fact 
that A~^ is symmetric positive-definite because so is A. 

From the expression of the inverse of a matrix using cofactors, we see that 
^i, 2 n-i-i = det(Ai_ 2 n+i)/det(A), where Ai^ 2 n+i is the matrix obtained from A 
by removing the first row and the last column. Since A is symmetric positive- 
definite, det(A) > 0, and we need only show that det(Ai^ 2 n+i) > 0. But one 
can see that ^i, 2 n+i is a skew symmetric matrix of size 2n, hence 

det(Ai 2n+l) = pf(24-l,2n-|-l)^, 

where pf(Ai^ 2 n-i-i) is the PfafRan of the matrix Ai^ 2 n+i- For more details on 
the Pfaffian (definition and proof of the result used here), see [16]. As a result, 
det(Ai^ 2 n+i) ^ 0 . 

Moreover, Ai_ 2 n-i-i is invertible. Indeed if we denote the colums of A (resp. 
the columns Ai 2 n+i) by Ci,..., C' 2 „+i (resp. Ci,..., C' 2 n), we observe that for 
0 ^ i ^ n — 1, C 2 i+i G El and C' 2 (i+i) € E 2 where 

El = Span{e 2 j+i; 0 < j < n - 1 }, 

E 2 = Span{e 2 (j+i); 0 < j < n - 1 }, 

with (ei)i^i^ 2 n is the canonical base of Since Ei and E 2 are in direct sum, if 
Ai, 2 n+i is not invertible it means that either (C 2 j+i)o^i^n-i or (C' 2 (i+i))o^i^n-i 
is linearly dependent. But it cannot be (C' 2 (i+i))o^i^n-i because it would imply 
that (C' 2 (i+i))o^i^n-i is also linearly dependent, because 

j • 

which would contradict the invertibility of A. 

So it means that (C' 2 i+i)o^i^n-i must be linearly dependent. However from 
the structure of the matrix A, one can see that for all 0 ^ z ^ n — 1 



( \ 


( (—l)*~'’^a2i+l,l ^ 


(—l)*~'’^a2i+i,i 


0 


0 


— (—l)*+^a2i+3,i 


— (—l)*+^a2i+3,i 


0 

^21+1 — 

0 

and C' 2 (i+i) = 





0 


0 


— (—l)*+^a2(i+„)+i,i 


\~(~l)*''’^02(i-|-n) + l,l/ 


'v 0 / 
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Thus, a linear combination between the elements of ((721+1 gives the 

same linear combination between the elements of ((72(i+i))o^i^n-i5 which con¬ 
tradicts for the same reason as before the invertibility of A. 

Hence Hi_ 2 n+i is invertible and bl,2n+l >0. □ 

The next result describes the structure of the matrices when $ is a 

convolution operator (be. for all a; € M, (p(x) = (p{- — x)) and fc € N* is odd. 

Lemma 8. Suppose that ^ is a convolution operator. Then for all k G N*, 
is a checkerboard matrix. Moreover ifl 2 N holds, then the entries indexed 
by (1, 2N + 1) and {2N + 1, 2N + 1) of (5'2jv^2Af)~^ are positive. 

Proof. Let {i,j) such that i+j is odd and for example i > j ('£'^'1'^ is symmetric 
so it does not matter). The entry {i,j) of ^'^'1'^ is equal to 

(V^i-l) ■ 

Recall that in the setting of Proposition 5, TL = Lf{X) and {f,g)-^ = fh- 
Successive integrations by parts (we integrate the left term and derive the right 
term) yield 

'Pj-i)^ = (~1) ^ \Ti±jfLk-ijTi±4fLk-i/ j 

(ipi-i,ipj-i)^ = (—1) ^ ■ 

Hence, by the symmetry of the scalar product, we obtain that 

(^i —1) — ~ Ij — ■ 

This implies that {ipi-i,ipj-x).^ = 0 for odd values of i + j, so that is a 

checkerboard matrix (see Definition 4). 

Moreover, similar computations show that when i + j is even 

((p,_i,(Pj_i)„ = (-1)^ 

If T 2 N holds, 'I' 2 jvd^ 2 Ar is symmetric positive-definite. As a result 'I' 2 jv'I' 2 Af 
satishes the assumptions of Lemma 7, so the entries (1,2A^ -I- 1) and (27V -|- 
1,2A^-|- 1) of ('I' 27 v^ 21 v)~^ are positive. □ 

We can now prove that (0) < 0 when $ is a convolution operator and 
I 2 N holds. 


Proof of Proposition 5. Since rjw = ^*Pw, we deduce that 

= {+2N,Pw)-h = (V’27V,'E'2Af-l(^'2Ar-l'I'2Ar-l)~^(52Ar).j^ . 
Consider the symmetric positive definite matrix 


«'2Ar«'27V 


f^2N-1^2N-l ^2 N-iT2n\ m2Nx2N 

\mM-lT 2 N]* \\T2N\\n J 
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Observe that 'I' 2 Ar_i^' 27 V-i is invertible, and that 

S = ||'^2Af||^ — (<^52^,'I'2Ar-l('I'2Ar-l'i^2JV-l) 7^ 0. 

Indeed, 


s — ||(Id« — n2Ar_i)(/?27v||^ > 0 


since {(pQ,... (/? 27 v) has full rank. 

Thus, we may apply the block inversion formula, and (4'2jv5'27v)~^ is of the 
form 


(«'2Ar^'2Ar)”^ = 


1 


||(Idw — n2Ar_i)(/32Ar|| 


n 


* * * 

-[(^2iV-1^2JV-l)-^d'2Ar 


-iP2n]* 


“(d^2JV-1^2JV-l) ^'i'2N-l‘P2N 
1 


Lemma 8 ensures that the entry (2fV + l, 1) of (5'2jvd^2Ar) ^ is (strictly) positive. 
This precisely means that 


{(f2N,'^2N-l{'^2N-l'^2N-l) ^ 

||(Idw — ^2N-i)P2n\\^ 
and as a result = {p 2 N,Pw)'h < 0- 


□ 


A.4 Proof of Proposition 6 

Let denote (afc)o^fc^ 2 Af-i the coefficients of the first column of the matrix 
(d^ 2 Ar-id^ 2 Af-i)~^- Then we know that for all x € M, 

N-l 

r]w{x) = ^ a2kp-^V^‘^^\x). 

Since (p-kif = x ^ we get that for all x G M, 

N-l 

r]w{x) = V7re“®^/'‘ ^ a 2 kH 2 k{x), (49) 

k=0 


where H 2 k is the Hermite polynomial of order 2k associated to a; —>■ e ^ Z^. 

Now let us show (18) recursively on iV € N*. If TV = 1, we have that 
Vwix) = e“® because 'I' 2 ^_^'I' 2 Ar-i = (v^)- Suppose that the property is 
true for some TV G N*, i.e. , 


Vw,n{x) 


N-l 

k=0 


22fcfc!’ 


(50) 
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where we use the notation to recall that this is the function riw for N 

spikes. Then form for all a; € M, 

k{x) =' e® {riw,N+i{x) - riw,N{x )). 

At is a polynomial of degree 2N (thanks to (49)), it satisfies k( 0) = 0 and for all 
1 < i < 2N — 1, At(®)(0) =0. As a result for all a: G M, 

k(x) = Ax^^, 


for some A G M. 

It remains to show that A = Remark that for all x G M, k(^^^(x) = 

A(2N)! and on the other hand At(^^^(0) = —vly^(O) because = 0 by 

definition of r/w,N+i- So it is enough to show that, 


Vw!n (0) 


(2N)l 

2‘^^Nl' 


(51) 


Thanks to the Leibniz formula applied to (50), we have the following formula, 


- ^\2h )^2Ar-2fc(0)22fc. ,• 
fe=0 ^ ^ 

Since i?2Ar-2fc(0) = ’ thanks to Lemma 9 below, we 

obtain. 



N-l 


V my- . ..N-k (2jv-2k-i)! 

^ (2N-2k)!(2k)! ^ ’ 22^-2fc-i(Ar-1 - fc)! 


N-l 


(2N)! N(N-l)l 2 •(-1)^-1-'= 

'22^ IV! ^ (N-l- k)!k! ’ (2N - 2k) 


(2N)l 

'22^iV! 


N-l 

k—O 


N-l 


(- 1 ) 


AT-l-fc 


„Ar-i-fc 


dx 


(2iV)! 


22Nn\ 


-N f (l-x)^-Mx. 
• Jo 


= 1/Ar 


(2fc)! 

22fcfc! 


This ends the recursive proof and thus we have (18). 

To conclude that rjw is (27V — l)-non-degenerate, it remains to show that 
for all X G M*, \riw(x)\ < 1, since we already know that ri^^\o) < 0 thanks 
to (51). But we have, thanks to (18), that for all x G M*, 0 < r]w{x) < 1 since 
J2k=o ts the truncated power series of 
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Lemma 9. One has for all k gN, 


*2^2(0) = (52) 

Proof. We know that Hk+ 2 {x) = xHk+i{x) — {k + l)Hk{x) where Hk is the 
Hermite polynomial of order k associated to x (->■ Thus -ff2fc+2(0) = 

— {2k + l)H 2 k{ 0 ) and then, 

fl2W2(0) = (-l)‘+'l?|j^^ (53) 

Now, remark that Hk{x) = 2~^^'^Hk{xl\f2), so that together with (53) we get 

the expected result (52). □ 


A.5 Proof of Theorem 3 

For all n e N, the solution m„ '*= ma„,t„z„ of 'Px^{yt^ + Wn) satisfies the 
following first order optimality conditions: 

rL.„ - $t.oao - Wn) + A„ = 0. (54) 


Since rjx^^tn = ^*(>^(^izoao + ic„ - $t„z„a„)) satisfies ||?7A„,t„|| l~(x) ^ 
e Im$* and T]x„,t„{tZn) = sign(a„) = 1. 

As a result, by taking n large enough if necessary, we know that (a„, Zn) = 
gf(An,Wn) with gl G ‘^‘^^{Vf‘) so that (a„, 2 :„) —>■ (ao,2:o)- Therefore all the 
asymptotic results that we established are true when applied for (a„, 2 „) for n 
large enough. Observe from (54), that we get for all n € N: 




= r: 


♦,+ 






As a consequence: 


lbA„,i„ - Prvll-H 0 when n+oo, 
so that for all 0 ^ ^ ^ 2N, 


(0 


(0 

Vw 


In particular, since lbA,t|lL°°(JC) 


—> 0 when n -G +oo. 

L“(X) 

< 1, we deduce (21): 


IIwIIl“(x) ^ 1- 


(55) 
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